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METHOD OF TRANSPOSON-MEDIATED MUTAGENESIS IN THE 
NEMATODE CAENORHABDITIS ELEGANS 

1. FIELD OF THE INVENTION 

The present invention relates to methods for generating and identifying mutations 
5 in the genome of the nematode Caenorhabditis elegans (hereinafter "C elegans"). More 
specifically, the present invention relates to a transgene construct for expression in C 
elegans, and to methods for regulating mobilization of heterologous or endogenous 
transposons in the C elegans genome, inserting a heterologous DNA sequence into C 
elegans germline DNA, and engineering mutations into the C. elegans genome. 

10 

2. TECHNICAL BACKGROUND 

The use of model genetic systems had its beginnings in the earliest days of the 
science of genetics and, as a result of the tremendous value of such systems in 
understanding genetic phenomena, continues in the present. Researchers often use in their 

15 work organisms which have short life spans, limited space requirements, and relatively 
small genomes. Specifically, certain species of worms, fruit flies, and yeast cells are 
common subjects of research. Using such organisms, researchers may learn the function 
of the various genes found within the DNA of the organisms. One commonly used 
method is to generate mutations in the genome of an organism, followed by selection or 

20 screening for those mutations which confer a specific property or characteristic to the 
organism. These mutational studies suggest probable functions for the genes in which 
mutations occur. Mutations often occur when a gene is changed in such a way that the 
product of the gene is altered or nonfunctional. 

A common method for generating mutations uses transposable elements. 

25 Transposable elements are segments of DNA which have the ability to "hop" — that is, to 
be excised from their initial position in the DNA and move to a new location. In doing 
this, a transposable element, also known as a transposon, may insert into some portion of 
a gene, thus disrupting or even changing the function of the gene. Further, additional 
mutations may be created by remobilizing the transposon. Since this remobilization often 

30 occurs imperfectly, changes are created in the DNA sequence, leaving the final sequence 
different from the original sequence. See J.D. Watson, J. Witkowski, M. Gilman, and M. 
Zoller, Recombinant DNA 175-190, 439-440 2d. cd. (1996). 
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The P element, a transposable element found in the genes of fruit flies, see, e.g., 
A. C. Spradling, G. M. Rubin, Science 218, 341 (1982); J.D. Watson, J. Witkowski, M. 
Gilman, and M. Zoller, Recombinant DNA 175, 177 2d. ed. (1996). has been an 
enormously useful tool in Drosophila genetic analysis for two reasons. First, these 
transposons have been used for insertional mutagenesis. Mutagenic insertions constitute 
5 molecular tags that are used to rapidly clone the mutated gene. L. Cooley, R. Kelley, A. 
Spradling, Science 239, 1121 (1988). Particularly helpful in such studies is the presence 
of strains that lack any copies of the transposon. Second, P elements are used to introduce 
single copies of foreign sequences into the host genome. This feature is particularly useful 
for the rapid identification of gene expression patterns by using enhancer traps. H. J. 

10 Bellen, et al., Genes Dev. 3, 1288 (1989). The availability of such techniques would be 
particularly advantageous in studies of the genome of the nematode C. elegans. 

C. elegans is a model system in which genetics can be used to identify genes and 
biological pathways which are conserved between nematodes and vertebrates, and which 
thus constitute potential targets for the treatment of various diseases. C. elegans is 

15 particularly advantageous for genetic studies because it is easily propagated and because 
the genetic and physical maps of its genome are well-characterized. W. B. Wood, 
Introduction to C. elegans Biology (1988). The characterization of gene structure in 
C. elegans has become routine, largely through the efforts of the C. elegans genome 
project. The workers involved in this effort have cloned the entire genome into cosmid or 

20 YAC vectors and have completed the genomic sequence. C elegans Sequencing 

Consortium, Science 282:2012-2018 (1998); A. Coulson et al., Proc Natl Acad Sci USA 
83:7821-7825 (1986); A. Coulson et al., Bioessays 13:413-417 (1991); R. Wilson et al., 
Nature 368:32-38 (1994). 

Standard mutagenesis in C. elegans employs chemical mutagens. After generation 

25 of a mutant, identification of the gene requires time-consuming genetic mapping followed 
by single gene rescue. Alternatively, transposon-based mutagenesis has been attempted 
using mutant backgrounds like mut-2, but efficiency of transposition is low and not 
specific for a defined transposon class. Further, since the genomes of all C. elegans 
strains contain transposons, it is very difficult to identify relevant insertions. Thus, utility 

30 of native transposons for regulated transposition in C. elegans is limited. First, all strains 
contain multiple copies of these transposons and thus new insertions do not provide 
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unique tags. Second, mutator strains tend to activate the transposition of several classes of 
transposons, so that the type of transposon associated with a particular mutation is not 
known. Third, transposition is not regulated and the transposon tag can be lost by excision 
in subsequent generations. Fourth, attempts to regulate transposase expression have failed 
because expression of transgenes in the germline of C elegans is very difficult. Although 
5 one could theoretically regulate the transposition of a specific element by expressing the 
transposase under the control of a germline-specific promoter, transgenic arrays are 
typically silenced in the germline. W. G. Kelly, S. Xu, M. K. Montgomery, A. Fire, 
Genetics 146, 227(1997) 

Another problem in this field is the difficulty of expressing DNA in the C. elegans 

10 germline. Current methods, see f e.g., W. G. Kelly et al, Genetics 146:227-238 (1997), 
are not adequate. First, current methods for expressing foreign DNA in the C. elegans 
germline do not work for all genes. Second, expression of genes introduced using these 
methods declines over time. 

Finally, introduction of single copy DNA is not possible using existing 

15 technology. 

From the foregoing, it will be appreciated that it would be a significant 
advancement in the art to provide methods that allow regulated expression of foreign 
DNA in the C elegans germline. It would be a further advancement to provide methods 
that allow germline expression of a transgene in C. elegans. It would be a further 

20 advancement in the art to provide regulated expression of such a transgene in the 
germline, as by regulation using a heat-shock promoter. It would be a further 
advancement to provide methods of regulating the transposition of either endogenous or 
heterologous transposons in C. elegans. Further, it would be an advancement to provide 
transgene constructs to facilitate germline expression of transgenes and regulated 

25 transposition of homologous and heterologous transposons. Such compositions of matter 
and methods are disclosed herein. 

3. BRIEF SUMMARY OF THE INVENTION 

The present invention relates to improved methods for generating and identifying 
30 mutations in C. elegans, and includes methods for introducing heterologous DNA into the 
C. elegans germline and causing its expression. In certain embodiments, a method of the 
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present invention comprises the steps of inserting a transgene construct into the C. 

elegans, wherein the construct comprises a heterologous gene operably linked to a 

promoter and a 3' untranslated region of a gene that is expressed in the C elegans 

germline; and expressing the heterologous gene. In certain embodiments, this method 

further comprises the removal of all bacterial plasmid sequences and repeated sequences 
5 from the DNA to be introduced. In certain preferred embodiments, a promoter that is 

active in the C. elegans germline drives expression of the transgene. In certain especially 

preferred embodiments, the promoter is an inducible promoter. 

The present invention further relates to a transgene construct for expression in C. 

elegans which comprises a heterologous gene operably linked to a promoter and a 3' 
10 untranslated region of a gene expressed in the C elegans genome. In certain 

embodiments, the transgene construct further comprises a promoter that is active in the 

germline of C. elegans or a promoter that is inducible. 

The present invention further relates to methods for generating and identifying 

mutations in C elegans. In one embodiment, a method of the present invention comprises 
1 5 the introduction and expression of a transposase gene to mobilize either endogenous or 

heterologous transposons. In certain preferred embodiments, the transposons are 

endogenous Tc3 transposons. 

In certain other embodiments, the transposons are heterologous transposons, such 

as the Drosophila mariner element. Controlled mobilization of heterologous transposons 
20 allows the generation of mutations, which are tagged by the insertion of the transposon. 

PCR-based techniques permit rapid identification of the transposon insertion that caused 

the mutation. 

The present invention further relates to methods for introducing single copy DNA 
sequences into C elegans. In certain preferred embodiments, a method of the present 

25 invention comprises introducing a transposon comprising a heterologous DNA sequence 
into a C. elegans, introducing a transgene construct comprising a transposase gene 
operably linked to a promoter and a 3' untranslated region of a gene that is expressed in 
the C. elegans germline, and expressing the transposase such that the transposase 
integrates into a C elegans chromosome as a single copy. The transposon may be 

30 engineered to introduce a DNA sequence, such as one that codes for a reporter gene such 
as, for example, a green fluorescent protein. The introduced DNA sequence may also 
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contain FRT/FLP or CRE/LOX recombination sites. Alternatively, the introduced DNA 
sequence may contain polyadenylation sites or transcriptional terminators. 

These and other features and advantages of the present invention will become 
more fully apparent from the following detailed description. 

4. SUMMARY OF THE DRAWINGS 

Figure 1 schematically depicts a method for mutagenesis by controlled 
heterologous transposition. 

Figure 2 depicts the structure of the pJL44 Mosl transposase expression vector. 

Figure 3 schematically depicts a method for identifying sequences flanking the 
Mos 1 insertion site using inverse PCR. 

Figure 4 depicts the sequence (SEQ ID NO.: 23) of an inverse PCR product. 
Nucleotides in capital letters are from the Mosl transposon. The C. elegans flanking 
genomic region is in lower case. It matches the Y47C4.A sequence from chromosome X 
available at the Sanger Centre. See the Sanger Centre web site at 
http://www.sanger.ac.uk/Projects/C_elegans/ . 

Figure 5 depicts the mobilization of Mosl in C. elegans somatic cells. (A) 
Engineering of the Mos transposase encoding sequence. Restriction sites were generated 
at the 5' and 3 f ends of the coding sequence (new sequence is indicated under the original 
sequence). The endogenous polyadenylation signal (boxed) was disrupted and an 
artificial intron was introduced in the coding sequence in order to improve transposase 
expression. See A. Fire, S. W. Harrison, D. Dixon, Gene 93, 189 (1990). (B) Localization 
of Mosl insertions in unc-49 and gpa-2 genes after induction of Mos transposase 
expression in somatic cells. Open triangles: insertion sites; black rectangles: coding 
exons; white rectangle: non coding exonic sequence. Arrows: genomic primers used to 
amplify the insertions. (C) Sequence comparison of 22 insertion sites. Insertion sites are 
oriented relative to the 5' end of the Mosl transposon. Sequences that flank Mosl at the 
right end were identified by PCR. DNA purification and PCR were performed as 
described in H. G. van Luenen, S. D. Colloms, R. H. Plasterk, EmboJ. 12, 2513 (1993). 
The primers in Mosl were 0JL88 (S'-CGCATGCGGCTTACTCAC (SEQ ID NO: 4)) 
first PCR; and oJL89 (S'-GGCCCCATCCGATTACCACCTA (SEQ ID NO: 5)) second 
PCR. Primers in unc-49vrece oJL19 (5 '-GCGAAACGC ATACCAACTGTA (SEQ ID 
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NO: 6)) first PCR; and oJL20 (5'-TTCATGCCGAAAAGCAGGCGT (SEQ ID NO: 7)) 
second PCR. Primers in gpa-2 were the same as described in H. G. van Luenen, S. D. 
Colloms, R. H. Plasterk, Embo J. 12, 2513 (1993). PCR products were gel-purified and 
sequenced using oJL89 (SEQ ID NO: 5) as a primer, (positive positions on the graph), 
sequences that flank the left end of Mosl were deduced from unc-49 and gpa-2 sequences 
5 (negative positions on the graph). 

Figure 6 depicts germline mobilization of Mosl. (A) Mos transposase was 
expressed from an extrachromosomal array using either a glh-2 or a heat-shock promoter. 
The Mosl transposon was contained in an array integrated on chromosome V 
(oxIs25[Mosl ;rol-6(sd]). The array containing chromosome was balanced by the dpy- 

10 1 l(e224) mutation. In the next generation, catastrophic excision of the transgene was 
observed (indicated as Aoxls) among the progeny. (B) Comparison of excision and 
insertion frequencies using glh-2 and heat-shock (hsp) promoters to drive Mos 
transposase expression in the germline. New Mosl insertions were identified by PCR. 
Specifically, the presence of Mosl was detected through PCR by using two primers 

15 located in the transposon, oJL102 (SEQ ID NO: 1) and oJL103 (SEQ ID NO: 3). The 

absence of D. mauritiana flanking sequence was checked using oJL102 (SEQ ID NO: 1) 
and oJL104 (SEQ ID NO: 2) as described below. In addition, a PCR positive control was 
performed on each DNA sample using oligonucleotides located in the cha-1 gene. 
Recombination events were recognized as Dpy worms also containing Mosl flanked by 

20 original Drosophila genomic sequences. 

Figure 7 shows Mosl genomic insertions. (A) Southern blot probed with labelled 
Mosl DNA. Lane 1 to 8, strains in which insertions were detected by PCR; insertions 
derived from an extrachromosomal array and Mos transposase expressed under the heat- 
shock promoter. Mosl presence was assessed by PCR using two primers located in the 

25 transposon (oJL102 (SEQ ID NO: 1) and oJL103 (SEQ ID NO: 3)). The absence of £>. 
mauritiana flanking sequence was checked using oJL102 (SEQ ID NO: 1) and oJL104 
(SEQ ID NO: 2), while a PCR positive control was performed on each DNA sample using 
oligonucleotides located in the cha-1 gene. The control lane is Iin-15(n765) which had 
been used to build transgenic lines. Each lane contains 2 mg of Bgl II digested genomic 

30 DNA. The Mosl probe (encompassing bases 1 to 174 of the transposon) was synthesized 
by PCR using the pBluescribeM13+/Mosl plasmid as a template. (B) Distribution of 
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Mosl inserts on the physical map of the C. elegans genome. Black triangles: insertions 
from an extrachromosomal array. Open triangles: insertions from the integrated array 
oxIs25. Open circle: position of oxIs25, the integrated array of Mosl transposons. (C) 
DNA sequence of Mosl de novo insertions. Genomic fragments that flank the transposon 
left end were isolated by inverse PCR and sequenced. A primer was designed in the 
5 genomic region to the right of the insert and used with a Mosl specific primer to amplify 
and sequence the right end flanking the fragment. At insertion sites TA dinucleotides 
(bold) were duplicated during the process of transposon integration. Lower case: Mosl 
sequence. Upper case: genomic sequence. 

Figure 8 shows a knock-in strategy wherein Mosl excision causes a DNA double 
10 strand break, after which a transgene containing sequences homologous to the excision 
region pairs with the chromosome. Finally, the mutation contained in the transgene is 
copied into the chromosome. 

These drawings only provide information concerning typical embodiments of the 
invention and are not therefore to be considered limiting of its scope. 

15 

5. DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to novel methods for generating and identifying 
mutations in C. elegans, and includes methods for introducing a transgene into the C 
elegans germline. The present invention also includes methods for expressing transgenes 

20 in the C. elegans germline. The present invention also includes a transgene construct for 
expression in C. elegans and methods for generating mutations by regulating the 
transposition of endogenous or heterologous transposons in C elegans. The present 
invention also includes methods for inserting single copy DNA into a C elegans genome 
by introducing a transgene comprising FLP Recombination Target (FRT) sites into a C. 

25 elegans and causing recombination. FLP is a site-specific recombinase which efficiently 
catalyzes recombination between FRT sites that have been placed in the genome. K. G. 
Golic, S. L. Lindquist, Cell 44, 521 (1986). When FRT sites are in the same relative 
orientation within a chromosome, the FLP recombinase excises the intervening DNA 
from the chromosome. See Golic, Kent G., Generating Mosaics By Site-Specific 

30 Recombination, In Cellular Interactions In Development: A Practical Approach, 1 -3 1 (D. 
A. Hartley, ed., Oxford Univ. Press 1993); and Plasterk R.H., Groenen J.T., Targeted 
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Alterations of the Caenorhabditis elegans Genome by Transgene Instructed DNA Double 
Strand Break Repair Following Tel Excision, EMBOJ., 11:287-90 (1992). Other 
recombination systems, such as CRE/LOX, could also be used. 

All publications, patents, and patent applications cited herein are hereby 
incorporated by reference. U.S. Patent Application Serial No. 60/136,972 is hereby 
5 incorporated by reference in its entirety. 

DEFINITIONS 

The term "heterologous" is used herein to include nucleic acid sequences such as 
coding sequences and control sequences that are not normally joined together, and/or are 
not normally associated with a particular cell. Thus, a heterologous region of a construct 

10 or vector is a segment of nucleic acid within or attached to another nucleic acid molecule 
that is not found in association with this other molecule in nature. For example, a 
heterologous region of a nucleic acid construct could include a coding sequence flanked 
by sequences not found in association with the coding sequence in nature (e.g., synthetic 
sequences having codons different from the native gene). Similarly, a cell transformed 

15 with a construct which is not normally present in the cell would be considered 

heterologous for purposes of this invention. The term includes, but is not limited to, a 
DNA sequence from another organism. 

The term "transgene" is a heterologous sequence that is introduced into an 
organism. The term includes both sequences that integrate into one or more chromosomal 

20 locations of the organism and sequences that are maintained extrachromosomally, e.g., as 
episomes. 

The term "regulable expression control element" includes promoters, 
polyadenylation signals, transcription termination sequences, upstream regulatory 
domains, origins of replication, internal ribosome entry sites, enhancers, and the like, 

25 which provide for the replication, transcription, and translation of a coding sequence in a 
recipient cell or in a cell of an organism. The term promoter refers to a DNA sequence, 
that is capable of binding RNA polymerase and initiating transcription of a downstream 
(3' direction) sequence. Inducible promoters are promoters which are regulable. Such 
promoters may be regulated by, for example, temperature, small molecules, or 

30 developmental stages of an organism. 

Inducible promoters include heat-shock promoters, which are induced by exposure 

8 
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to heat. Inducible also promoters include small molecule-regulated promoters. Other 
inducible promoters include promoters that are induced (or repressed) by tetracycline and 
its derivatives (Gossen & Bujard, Proc. Natl Acad. Set USA 89:5547-5551 (1992)). 

"Operably linked" refers to an arrangement of elements in which the components 
so described are configured so as to perform their usual function. Thus, control sequences 
5 such as regulable expression control elements operably linked to a coding sequence are 
capable of affecting the expression of the coding sequence. The control sequences need 
not be contiguous with the coding sequence so long as they function to direct the 
expression thereof. Thus, for example, intervening untranslated yet transcribed sequences 
can be present between a promoter sequence and the coding sequence and the promoter 

10 sequence can still be considered "operably linked" to the coding sequence. 

For the purpose of describing the relative position of nucleotide sequences in a 
particular nucleic acid molecule throughout the instant application, such as when a 
particular nucleotide sequence is described as being situated "upstream," "downstream," 
"3'," or "5'," relative to another sequence, it is to be understood that it is the position of 

1 5 the sequences in the "sense" or "coding" strand of a DNA molecule that is being referred 
to as is conventional in the art. 

GENERAL METHODS 

In one embodiment, a method of regulated expression of a heterologous gene in 
cells of the germline of C. elegans comprises the steps of inserting a transgene construct 

20 into the C. elegans, wherein the construct comprises the heterologous gene operably 
linked to a promoter and a 3 s untranslated region of a gene that is expressed in the C. 
elegans germline; and expressing the heterologous gene. Other embodiments further 
comprise the use of a promoter which is inducible, such as a heat-shock promoter or a 
tetracycline-regulated promoter. Yet other preferred embodiments comprise the removal 

25 of substantially all bacterial plasmid sequences and repeated sequences from the 

transgene. In certain other especially preferred embodiments, the method of the present 
invention comprises the addition of the 3' untranslated region (UTR) of theg/A-2 gene, 
which is expressed in the C. elegans germline, to the 3' end of the transgene. A promoter, 
such as a glh-2 promoter, which is a germline specific promoter, may be used to drive 

30 expression of the transgene. 

The present invention further comprises transgene constructs for expression in C 
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elegans which comprises a heterologous gene operably linked to a promoter and a 3' 
untranslated region of a gene that is expressed in the C. elegans germline. In certain 
embodiments, the promoter is active in the cells of the germline of C elegans. In other 
embodiments, the promoter comprises an inducible promoter such as a heat-shock 
promoter or a tetracycline-regulated promoter expressed in the germline. Yet other 
5 embodiments comprise the removal of substantially all bacterial plasmid sequences and 
repeated sequences from the transgene. In certain preferred embodiments, the method of 
the present invention comprises the addition of a 3' untranslated region (UTR) of d.glh-2 
gene, which is expressed in the C. elegans germline, to the 3' end of the transgene. A 
promoter, such as ag/A-2 promoter, may be used to drive expression of the transgene. In 

1 0 still other embodiments, the heterologous gene codes for a transposase. In certain 
preferred embodiments, the heterologous gene is a TC3A transposase gene. 

The present invention also includes methods for generating mutations in the 
genome of a C. elegans by using controlled mobilization of transposons. In certain 
embodiments, the method for generating mutations in the genome of a C. elegans 

1 5 comprises the steps of introducing a transgene construct comprising a transposase gene 
which is operably linked to a regulable expression control element and a 3' untranslated 
region of a gene that is expressed in the C. elegans germline into a C. elegans; and 
expressing the transposase gene. Such methods allow the generation of mutants in which 
the mutated genes are tagged by the insertion of the transposon. PCR-based techniques 

20 permit fast identification of the transposon insertion that causes the mutation. Since the C 
elegans genome has been entirely sequenced, sequencing of the genomic regions that 
flank the transposon allows immediate identification of the mutated gene. 

In certain embodiments, transposons used in the method of the present invention 
are endogenous transposons. Several different types of endogenous transposons are 

25 present in C. elegans, and these can be mobilized in mutator strains. See, e.g., R. H. A. 
Plasterk, H. G. A. M. van Luenen, in C. elegans II, D. L. Riddle, T. Blumenthal, B. J. 
Meyer, J. R. Priess, Eds. (Cold Spring Harbor Laboratory Press, New York, 1997) pp. 97- 
1 16. Mutator alleles have been useful in cloning C elegans genes, particularly in early 
studies before the genome project reagents were widely available. In certain preferred 

30 embodiments, the endogenous transposons are Tc3 transposons. In yet other 
embodiments, the transposase gene is a TC3A transposase gene. In still other 

10 
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embodiments, the regulable expression control element is an inducible promoter, 
comprising in some embodiments a heat-shock promoter or a tetracycline-regulated 
promoter. Yet other embodiments comprise the removal of substantially all bacterial 
plasmid sequences and repeated sequences from the transgene construct. In certain 
preferred embodiments, the method of the present invention comprises the addition of the 
5 3' untranslated region (UTR) of the glh-2 gene, which is expressed in the C. elegans 
germline, to the 3' end of the transgene. In such embodiments, the regulable expression 
control element comprises a promoter, such as a promoter or a heat-shock promoter, 
which may be used to drive expression of the transgene. 

In other embodiments, mutants may be generated by using controlled mobilization 

10 of heterologous transposons. Using a heterologous transposon allows researchers to tag 
mutated genes with a sequence that is unique in C. elegans genome. These tagged 
mutations will allow the rapid cloning of the mutated genes. The primary advantage over 
the endogenous transposon scheme is that this method avoids the isolation of irrelevant 
insertions of the endogenous C elegans transposons. A further advantage is that the 

15 expression of the heterologous transposase would only mobilize the heterologous 
element, and thus mutations should only be due to insertions of these elements. 
Additionally, insertions could be stabilized by loss of the transposase-expressing 
construct. 

In certain preferred embodiments, Mosl, a mariner-like transposon isolated from 
20 Drosophila mauritiana, is used. M. Medhora et al., Genetics 1 28:3 1 1 -3 1 8 ( 1 991 ). See 
generally D. L. Haiti et al, Annu. Rev. Genet 31:337-358 (1997). Mosl is a member of 
the mariner/Tel family and was initially identified in the fruitfly Drosophila mauritiana. 
J. W. Jacobson, M. M. Medhora, D. L. Haiti, Proc. Natl Acad. Sci. USA83, 8684 
(1986). Like the other members of the mariner/Tel family, Mosl contains a single open 
25 reading frame which encodes the transposase. The transposase binds to and cleaves at the 
inverted terminal repeats (ITRs) present at each end of the transposon. See, e.g., D. L. 
Haiti, A. R. Lohe, E. R. Lozovskaya, Annu. Rev. Genet. 31, 337 (1997); R. H. Plasterk, Z. 
Izsvak, Z. Ivies, Trends Genet. 15, 326 (1999). The Mosl transposase is the only protein 
necessary for transposition in vitro. L. R. Tosi, S. M. Beverley, Nucleic Acids Res. 28, 
30 784 (2000). Because no additional factors are required for transposition, the Mosl 

transposon should be capable of transposition in heterologous species, and indeed the 
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transposon has been mobilized in species evolutionarily distant from Drosophila. F. J. 
Gueiros-Filho, S. M. Beverley, Science 276, 1716 (1997); J. M. Fadool, D. L. Hartl, J. E. 
Dowling, Proc. Natl Acad. ScL U SA 95, 5182 (1998); A. Sherman, et al., Nat 
Biotechnol 16, 1050 (1998); C. J. Coates, N. Jasinskiene, L. Miyashiro, A. A. James, 
Proc. Natl Acad. Set USA 95, 3748 (1998). 
5 In other embodiments, the transposase gene comprises restriction sites 5' of the 

start codon, restriction sites 5' of the stop codon, and an artificial intron in the transposase 
gene open reading frame. Other preferred embodiments involve a regulable expression 
control element which comprises an inducible promoter such as a heat-shock promoter or 
a tetracycline-regulated promoter. Yet other preferred embodiments comprise the removal 

10 of substantially all bacterial plasmid DNA sequences and repeated sequences from the 
transgene construct. In certain preferred embodiments, the method of the present 
invention comprises the addition of the 3' untranslated region (UTR) of the glh-2 gene, 
which is expressed in the C elegans germline, to the 3' end of the transgene. In such 
embodiments, the regulable expression control element may comprise a promoter, such as 

15 a glh-2 promoter, a myo- 3 promoter or a heat-shock promoter, which may be used to drive 
expression of the transgene. 

In yet other embodiments, the method of the present invention includes 
engineering the transposon to carry a heterologous DNA sequence into a C. elegans 
chromosome. Certain embodiments of the method of the current invention may comprise 

20 the steps of introducing a transposon into the C. elegans, wherein the transposon 

comprises the heterologous DNA sequence; introducing a transgene construct into the C. 
elegans, wherein the construct comprises a transposase gene which is operably linked to a 
promoter and a 3' untranslated region of a gene that is expressed in the C elegans 
germline; and expressing the transposase, such that the transposon integrates as a single 

25 copy into a C. elegans chromosome. 

In some embodiments, the transposon may be modified to contain bacterial 
plasmid DNA sequences. Such sequences may simplify cloning of mutated genes into 
bacteria from C. elegans genomic DNA preparations. In yet other embodiments, the 
transposon may carry a gene useful for selection or screening purposes. 

30 In certain preferred embodiments, FRT/FLP or CRE/LOX recombination sites 

could be inserted into the transposon. One of skill in the art would appreciate that an 
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engineered transposon carrying such recombination sites would facilitate insertion of 
single copy DNA into the G elegans genome. In other embodiments, the transposon 
could include polyadenylation sites or transcriptional terminators. 

In yet other preferred embodiments, the promoter is inducible. In such 
embodiments, inducible promoters such as a heat-shock promoter, may be used. Yet other 
5 preferred embodiments comprise the removal of substantially all bacterial plasmid DNA 
sequences and repeated sequences from the transgene construct. In certain preferred 
embodiments, the method of the present invention comprises the addition of the 3' 
untranslated region (UTR) of the glh-2 gene, which is expressed in the G elegans 
germline, to the 3' end of the transgene. In such embodiments, the regulable expression 
10 control element may comprise a promoter, such as a glh-2 promoter, a myo-3 promoter or 
a heat-shock promoter, which may be used to drive expression of the transgene. 

6. EXAMPLES 

The following examples are given to illustrate several embodiments which have 
15 been made within the scope of the present invention. It is to be understood that these 
examples are neither comprehensive nor exhaustive of the many types of embodiments 
which can be prepared in accordance with the present invention. 

Example 1 - Mobilization of Endogenous Tc3 Transposons 
About 15 copies of the Tc3 transposon are present in the genome of the wild-type 
20 G elegans N2 strain. These transposable elements are inactive in wild-type animals. Our 
goal is to cause specific mobilization of the endogenous Tc3 copies by expressing the 
TC3A transposase in the germline. New Tc3 insertions will be used as tags to clone the 
genes which they have disrupted. 

1- TC3A expressed under the ced-9 promoter causes somatic and germline hops: 
25 The TC3A transposase gene has been cloned behind a ced-9 promoter. This 

construct has been coinjected with linearized C. elegans genomic DNA and the lin-15(+) 
plasmid into a lin-15(-) strain and unstable transgenic strains have been obtained. 
Transposase activity was assayed by testing whether the construct could excise a Tc3 
element from the unc-22 gene and restore the function of the locus. The ced-9: :Tc3A 
30 arrays have been crossed into unc-22(r750::Tc3); Un-15(n765ts) background. Wild-type 
revertants have been recovered from Unc-22 Fl animals, suggesting functional expression 
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of the TC3A transposase. One of these extrachromosomal arrays was integrated into a 
chromosomal location to generate the insertion oxhl7[ced-9::Tc3A; lin-15(+)]. 

oxhl 7 was mapped on the X chromosome and functionally characterized. 
Un-15(n765ts);oxhl7[ced-9::Tc3A; lin-15(+)] males were crossed to unc-22(r750::Tc3); 
lin-15(n765ts) hermaphrodites. Heterozygous nonUnc nonLin hermaphrodites were 
5 cloned and allowed to self-fertilize. It was expected that among the progeny of these 

animals, there would be found 1/4 Unc animals homozygous for unc-22(r750::Tc3). Of 
those Unc individuals, 3/4 should be either homozygous or heterozygous for oxhl 7 i.e. 
nonLin. It was observed, however, that the Unc nonLin animals were greatly under- 
represented — instead, there were many more nonUnc individuals. Since the ced-9 gene is 

10 ubiquitously expressed, it was reasoned that TC3A could be present not only in the 
germline but also in somatic cells and could cause somatic reversion of the Unc 
phenotype. To test this hypothesis, nonUnc nonLin individuals were cloned, assuming 
that a fraction of them could be homozygous for unc-22(r750::Tc3) despite their wild- 
type phenotype. Self-progeny of these animals were scored. Individuals heterozygous for 

15 oxIsl7 segregated 1/4 Lin animals (which no longer expressed the TC3A transposase). In 
this category, plates were identified in which 100 % of the Lin worms were Unc while 
almost 100% of the nonLin were nonUnc. Hence, the parent hermaphrodite must have 
been homozygous unc-22(r750::Tc3) mutant although its phenotype was wild-type. 
These data demonstrate that ced-9::Tc3A causes somatic reversion of the unc- 

20 22(r750::Tc3) locus at high frequency. 

Rare nonUnc Lin were looked for among the Lin animals generated by self- 
fertilization of unc-22(r750::Tc3); lin-15(-); oxIsl7/+ hermaphrodites to determine 
germline reversion rates. Since the Lin worms had lost oxlsl 7 and had no TC3A 
transposase expressed somatically during development, the only way to revert the Unc 

25 phenotype was to receive one reverted copy of the unc-22 locus. This reversion event had 
to occur during germline development. Rare nonUnc Lin progeny were identified among 
Unc Lin progeny (experiment #1:1 nonUnc Lin in 61 Lin total; exp. #2: 2/106; exp. #3: 
4/203). It was concluded that ced-9::Tc3A causes an approximately 2% reversion rate of 
the unc-22(r750::Tc3) locus in the germline. 

30 2- Expression of TC3A using a g//?-2 promoter: 

Since the somatic reversion caused by ced-9: :Tc3A causes a discrepancy between 
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the phenotype and the genotype of an individual carrying a locus disrupted by a Tc3 
insertion, a TC3A expression vector was designed based on the germline specific glh-2 
gene (gift of Karen Bennett). A plasmid containing a glh-2 genomic fragment is able to 
rescue the Glh-2 mutant phenotype and is therefore likely to be expressed in the germline. 
The glh-2 open reading frame was deleted and replaced by a multiple cloning site to 
5 generate an expression cassette that retains glh-2 promoter and 3* untranslated regions. 
Tc3A was inserted to generate glh-2::Tc3A. This construct has been coinjected with 
linearized C elegans genomic DNA and the lin-15(+) plasmid into a lin-15(-) strain and 
several unstable transgenic strains have been obtained. A plasmid driving strong 
expression of the Green Fluorescent Protein (hereinafter "GFP") in the coelomocytes (gift 

10 of Piali Sengupta) has been incorporated also in this array. This allows monitoring of the 
presence of the array in a lin-15(+) background based on GFP expression. 

As described above, these arrays have been crossed into unc-22(r750::Tc3); 
Hn-15(n765ts) background. In contrast to the ced-9::Tc3A experiments, no somatic 
reversion events were observed. Germline reversion events were observed in the progeny 

15 of unc-22; oxEx[glh-2::Tc3A] hermaphrodites (experiment #1 : 2/1914 total scored 
animals; experiment #2: 5/4312). It was concluded that glh-2::Tc3A causes a 0.1% 
reversion rate of the unc-22(r750::Tc3) locus in the germline. 

Example 2 - Expression of the Mariner Transposase in Q elezans 
A mutagenesis strategy was also developed that uses the mariner transposon from 

20 the homfly (gift of David Lampe and Hugh Robertson). Mariner transposons from 

Drosophila are related to C. elegans Tc transposons. In fact, members of the Tc/mariner 
family of transposable elements have been identified in a broad range of species. R. H. A. 
Plasterk & H. G. A. M. van Luenen, Transposons, in C. elegans 7/97-1 16 (D.L. Riddle et 
al. eds., 1997). Horizontal transfer may be responsible for the broad distribution of this 

25 family of transposable elements. Horizontal transfer implies that specific host factors are 
not required for transposition and biochemical characterization has borne this supposition 
out. Purified transposase is able to catalyze the transposition of mariner or of Tc elements 
from a host plasmid to a target plasmid. D. J. Lampe et al., EMBO J 15:5470-5479 
(1996); J. C. Vos et al., Genes Dev 10:755-761 (1996). This has enabled researchers to 

30 mobilize mariner elements from Drosophila in other Dipteran species. T. G. Loukeris et 
al., Proc Natl Acad Sci USA 92:9485-9489 (1995); T. G. Loukeris et al., Science 
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270:2002-2005 (1995); A. R. Lohe & D. L. Hartl, Genetics 143:365-374 (1996). 
Recently, a mariner element from Drosophila has been mobilized in Leishmania, which 
represents a trans-kingdom transposition. F. J. Gueiros-Filho & S. M. Beverly, Science 
276:1716-1719 (1997). Thus, it was possible that mariner would be active in C. elegans 
as well. 

5 A plasmid encoding the mariner transposase HIMAR1 was received from David 

Lampe and Hugh Robertson. First, the transposase coding sequence was engineered to 
allow for efficient expression in C. elegans. Restriction sites were inserted immediately 
upstream to the start codon and just before the stop codon to facilitate subcloning of the 
fragment in various expression vectors. An artificial intron was inserted in the open 
10 reading frame since the presence of introns improves the expression level of transgenes in 
C. elegans. 

Engineered Himarl was placed under the control of the muscle specific promoter 
myo-3. The myo-3:: Himarl construct was injected with the lin-15(+) plasmid into a 
lin-15(-) strain and unstable transgenic strains obtained. Expression of the HIMAR1 

15 transposase was examined first by Western Blot. Extracts were prepared from oxExfmyo- 
3::Himarl; lin-15(+)] worms, run on a denaturing acrylamide gel and transferred to a 
nitrocellulose membrane. The membrane was probed with previously characterized 
antibodies that recognize the HIMAR1 protein (provided by David Lampe and Hugh 
Robertson). In extracts of transgenic worms, an approximately 42 kD protein which 

20 corresponds to the expected molecular weight of HIMAR1 was detected. The signal was 
absent from non-transgenic worm extracts. Using the same antibodies, the protein was 
visualized in situ using immunofluorescence on oxEx[myo-3::Himarl ; lin-15(+)] worms. 
Intense immunoreactivity was detected which was restricted to the nuclei of muscle cells. 
These data indicate that the HIMAR1 mariner transposase is expressed and properly 

25 targeted to nuclei in C. elegans cells. 

Example 3 - Germline Expression of the Transposase Using the glh-2 Promoter 
Generation of heritable Mosl insertions would require expression of the Mos 
transposase in the germline. However, expression of transgenes in the germline of C. 
elegans is not possible using standard techniques. Typically, transgenic worms are 

30 generated by injecting plasmid DNA into the gonads of C elegans (C. C. Mello, J. M. 
Kramer, D. Stinchcomb, V. Ambros, EmboJ. 10, 3959 (1991)). These fragments then 
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form a simple array of repeated DNA segments. Although gene expression is robust in 
somatic tissues, such simple arrays are not expressed in the germline or are silenced after 
a few generations. Co-injection of genomic DNA with plasmid DNA improves germline 
expression, presumably by preventing tandem repeats in the array. W. G. Kelly, S. Xu, M. 
K. Montgomery, A. Fire, Genetics 146, 227 (1997). To express the Mos transposase in the 
5 germline, an expression vector containing the promoter and the 3* UTR of the glh-2 gene 
was built. This gene encodes a germline helicase which is specifically expressed in the 
gonad. M. E. Gruidl, et al., Proc. Natl Acad. Set USA 93, 13837 (1996). Transgenic 
lines carrying extrachromosomal arrays of the glh-2: :Mos transposase construct were 
generated by microinjection. To maximize expression in the germline, constructs were 

10 isolated from plasmid vector sequences and were coinjected with fragmented genomic 
DNA. (The Mos transposase coding sequence was introduced between the promoter and 
the y UTR of glh-2. Specifically, this construct (pJL9) contains 2.2 kb of the glh-2 
genomic sequence immediately upstream of the translation start site (nt 29,882 to 32,095 
in cosmid C55B7), an Mlu I-Nhe I cloning site, and 0.8 kb of sequence immediately 

15 downstream of the glh-2 stop codon. An Mlu I-Nhe I fragment containing the Mos 

transposase was subcloned into pJL9 to generate the glh-2: :MosTransposase construct. 
Iin-15(n765) hermaphrodites were injected with a Spe I - Kpn I fragment of glh- 
2::MosTransposase (injection concentration 10 ng///l), with lin-15(+) (EKL15) and ofin- 
l::gfp (pPD97/98) fragments and N2 worm genomic DNA as described above for the 

20 generation of the oxExl66[hsp::MosTransposase] array.) 

Transposase expression in the germline was determined by assaying for excision 
of transposons from a defined chromosomal location. Specifically, the Mw/-containing 
extrachromosomal array was integrated into chromosome V to generate oxIs25[Mosl ;rol- 
6(sd)]. The oxIs25 array was mapped less than 0.54 map units from dpy-1 1 . 

25 Heterozygous oxIsl2/dpy-l 1 worms were generated. These animals largely segregated 
Dpy and Rol progeny as expected for these closely linked markers (Figure 6). However, 
when the glh-2: :Mos Transposase transgene was crossed in, approximately 16% of the 
nonDpy progeny were nonRol (15.7 % ± 0.9, mean ± SEM, n=44 plates). The nonRol 
phenotype was stably inherited. It was hypothesized that in those worms, the Mos 

30 transposase excised Mos 1 from the integrated array. The resulting DNA breaks were 
responsible for catastrophic excision of the entire locus, including interspersed rol-6 
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copies. The correlated loss of rol-6(sd) and Mos 1 was confirmed by PCR, in which 
NonRol individuals were cloned, selfed and DNA was purified from the progeny. Mosl 
containing fragments were detected by PCR using one primer complementary to Mosl 
(OJL102: 5'- CAACCTTGACTGTCGAACCACCATAG (SEQ ID NO: 1)) and one 
primer complementary to D. mauritiana flanking DNA (oJL104: 5- 
5 ACAAAGAGCG AACGCAGACGAGT (SEQ ID NO: 2)). Of 1 88 nonRol worms, only 
one individual retained a copy of the Mos 1 fragment that was initially present in the 
transgene. Based on the phenotypic reversion of the Rol phenotype, it was calculated that 
1 in 5 chromosomes experienced catastrophic excision of the transgene (20.9 % ± 1.1 %, 
mean ± SEM, n=44 plates) The probability p of a single chromosome containing the array 

10 of Mosl elements transgene experiencing "catastrophic excision" can be derived from a 
Punnett square where the ratio R of nonRol worms over the total number of the progeny: 
R = l/4p + l/4p + (l/2p) 2 . These results demonstrated that the #//f-2-based expression 
vector expressed the transposase in the germline and that the Mos 1 transposon in the 
chromosome was recognized as a substrate. 

15 To determine if excision of Mos 1 from the array was associated with insertion in 

the genome, the progeny of animals expressing the transposase in the germline were 
screened for de novo insertions. Specifically, using PCR, the presence of the Mos 1 
element in the absence of the Drosophila sequences which flank the transposon in the 
array was assayed. Mosl presence was assessed by PCR using two primers located in the 

20 transposon (oJL102 (SEQ ID NO: 1) and oJL103: 5'- 

TCTGCGAGTTGTTTTTGCGTTTGAG (SEQ ID NO: 3)). The absence of D. 
mauritiana flanking sequence was checked using oJL102 (SEQ ID NO: 1) and oJL104 
(SEQ ID NO: 2) as described above. In addition, a PCR positive control was performed 
on each DNA sample using oligonucleotides located in the cha-1 gene. Because the 

25 integrated array containing unmobilized transposons also contained rol-6(sd), insertions 
were sought in nonRol progeny; specifically, either nonRol animals that experienced 
catastrophic excision of the array or Dpy progeny (Figure 6B) were analyzed. Insertions 
were identified in 1% of nonRol progeny (2/227) and in 10 % of Dpy progeny (11/116 
F1+F2 Dpys). These results demonstrated that transposition of Mos 1 could be achieved 

30 in the C. elegans germline. However, it was observed that high rates of excision were not 
accompanied by high rates of insertion; these results support previous data indicating that 
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these two processes are not coupled. H. G. van Luenen, S. D. Colloms, R. H. Plasterk, 
Celll% 293 (1994). 

Using integrated arrays as a source of transposons prevents the easy recovery of 
new insertions that occur on the same chromosome; this bias could be circumvented by 
using an extrachromosomal array of transposons. In addition, extrachromosomal arrays 
5 are not completely stable in meiosis, which makes the isolation of strains lacking 

immobilized transposons easy after mobilization. Therefore, it was tested whether Mos 1 
could be mobilized from an extrachromosomal array into the chromosomes. Specifically, 
the glh-2::Mos Transposase construct was used to mobilize transposons from a Mos l- 
bearing array (oxExl64[Mosl; rol-6(sd)]). The nonRol progeny from double transgenic 
10 animals (oxExl67[glh-2::MosTransposase]; oxExl64[Mosl ; rol-6(sd)]) were analyzed 
for transposition events using PGR. An insertion frequency of 1% (3 insertions/ 302 
progeny, Table 1) was detected. Thus, these results closely match those obtained for 
integrated arrays. 

Table 1. Frequencies of Mosl genomic insertions from an extrachromosomal array. 

15 nonRol progeny of oxExl64[Mosl; rol-6(sd)]; oxExfMosTransposaseJ were analyzed by 
PCR for the presence of Mosl and the loss of the Drosophila flanking sequences present 
in the donor plasmid. Mosl presence was assessed by PCR using two primers located in 
the transposon (oJL102 (SEQ ID NO: 1) and oJL103: 5'- 
TCTGCGAGTTGTTTTTGCGTTTGAG (SEQ ID NO: 3)). The absence of D. 

20 mauritiana flanking sequence was checked using oJL102 (SEQ ID NO: 1) and oJL104 
(SEQ ID NO: 2) as described above. In addition, a PCR positive control was performed 
on each DNA sample using oligonucleotides located in the cha~l gene. When heat- 
shocked (1 hour at 35°C) POs were moved to fresh plates and eggs were collected for the 
next 24 hours. *During experiment #5 the stability of the oxEx 164 [Mosl ; rol-6(sd)] 

25 transgene reached 75 % while in previous experiments it was approximately 20 %. 



Transposase 
construct 


Transposition frequency 




no heat-shock 


with heat-shock 


glh-2: MosTransposase 


exp #1:2/108 1.9% 
exp#2:0/104 0% 
exp #3: 1/90 1.1% 


exp#l:ND 
exp #2: ND 
exp #3: 0/65 0% 
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hsp:: MosTransposa.se 


exp ffl: JNJJ 


exp ffi . h/oj 


O.Z /O 




exp #2: 0/33 


exp #2: 3/98 


3.1 % 




exp #3: 0/39 


exp #3: 6/87 


6.8 % 




exp #4: ND 


exp #4: 5/85 


5.9% 




exp #5: 0/44* 


exp #5: 15/34* 


44.1 % 



Example 4 - Mos 1 - Mobilization in the Germline Using a Heat-Shock Promoter 

Theglh-2 promoter expresses the transposase in the germline constitutively. 
Constitutive expression of the transposase has two disadvantages. First, crosses must be 
5 set up fresh every generation to guarantee that the array remains intact and does not 
accumulate inherited changes. Second, because the tranposase was expressed in the 
germline early in development, events identified in the progeny might not be independent 
but might have occurred when the germline was still comprised of only few cells. 
Expression limited to adults can be achieved by using a heat-shock promoter. Expression 

10 of the transposase could be induced after a strain containing the transposase and 

transposons had been propagated and expanded to many animals. In addition, heat- 
shocking animals with mature germlines would maximize the independence of insertion 
events. Animals expressing the transposase under the control of the heatshock promoter 
and bearing the integrated transposon (oxIs25/dpy-l 1 ; oxExl66[hsp::Mos Transposase]) 

15 were heat-shocked. POs could only be heat-shocked for 45 minutes; such animals were 
almost paralyzed, stopped eating and had low brood sizes. Longer heatshock caused the 
animals to die. It was speculated that this lethality is due to high rates of transposition in 
somatic cells. Ubiquitous expression of the transposase would cause double strand breaks 
in the chromosome at the site of integration in every cell which may cause cell cycle 

20 arrest or apoptosis. G. Evan, T. Littlewood, Science 281, 1317 (1998). 

Fl progeny were analyzed for catastrophic excision, that is, for the appearance of 
nonRol nonDpy progeny (Figure 6A). In contrast with results obtained using the glh-2 
expression vector, catastrophic excision was not observed. Only rare nonRol progeny 
were generated which were likely to be the result of recombination between the array and 

25 the dpy marker (Figure 6B). However, by analyzing the Dpy progeny of heat-shocked 
animals, it was discovered that the heatshock construct caused the efficient insertion of 
Mos into new locations in the genome. Approximately 27% (25/94) of Fl or F2 Dpy 
worms carried novel transposon insertions (Figure 6B). In addition to novel insertions, 
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recombinant chromosomes containing both the dpy marker and the original array arose at 
high frequency. These were identified as Dpy animals containing transposons flanked by 
Drosophila DNA. This hotspot for recombination is likely to arise as a result of double 
strand breaks introduced into the array by the transposase. A. R. Lohe, C. Timmons, I. 
Beerman, E. R. Lozovskaya, D. L. Hartl, Genetics 154, 647 (2000). 
5 It was then tested whether transposition could occur from an extrachromosomal 

array using the heatshock promoter construct to express the transposase (oxExl66; 
oxExl64[Mosl; rol-6(sd)]). Hermaphrodites bearing both arrays were heat-shocked as 
young adults. The nonRol progeny were analyzed by PCR for transposition events. New 
insertions were observed in 8.9% of the Fl (33/369 progeny, Table 1). Since transposition 

10 could have occurred into the transposase containing array, F2 animals that lost this array 
were isolated from eight transposon-bearing strains. Genomic DNA was prepared from 
these strains and analyzed for the presence of MosL The transposon was still detected in 
all 8 strains, thus demonstrating that the transposon had not inserted in the array. No 
insertions could be detected in 1 16 Fl clones derived from non-heat-shocked parents. The 

1 5 frequency of transposition was low but one of the main limiting factors is the stability of 
the extrachromosomal array that is used as a transposon source. The initial experiments 
were performed when the array was only 20% stable and transposition frequencies were 
in the range of 5%; when the array matured and was about 75 % stable, transposition 
frequency reached 44%. Generating more stable extrachromosomal arrays could increase 

20 the frequencies of transposition. 

The heatshock promoter was able to drive expression of Mos transposase in the 
germline and to promote transposition events at a higher rate than obtained using the 
glh-2 construct. Temperature has been shown to affect transposition frequency in other 
organisms. D. Garza, M. Medhora, A. Koga, D. L. Hartl, Genetics 128, 303 (1991). One 

25 possible explanation for the efficient transposition observed after heatshock is that 

chromatin structure is somehow altered by the heatshock. Therefore, it was tested whether 
heatshock itself could account for the difference in transposition frequencies. Parents with 
extrachromosomal arrays carrying the g/A-2-transposase construct and carrying the 
transposon were heatshocked and progeny were tested for transposition. The frequency of 

30 transposition was not improved by the heatshock treatment (Table 1). Thus, heatshock 
itself does not facilitate efficient transposition. 
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Example 5 - Transposition in Somatic Cells 

To determine whether the Mosl element could be mobilized in C. elegans cells, 
Mosl transposition in somatic cells was first analyzed. The gene encoding the Mosl 
transposase was engineered to improve expression in the worm and placed under the 
control of a heat-shock promoter. The Mos transposase encoding sequence was PCR 
5 amplified out of pBluescribe Wl\2+IMosl (M. Medhora, K. Maruyama, D. L. Hartl, 

Genetics 128, 311 (1991), modified as described in Figure 5A and subcloned as a Mlu I- 
Nhe I fragment between the hsp- 16-48 promoter (H. G. van Luenen, S. D. Colloms, R. H. 
Plasterk, Embo J. 12, 2513 (1993)); E. P. Candido, et al., Genome 31, 690 (1989) and the 
glh-2 3' untranslated region (fragment 35383 to 36190 in cosmid C55B7) (M. E. Gruidl, 

10 et al., Proc. Natl Acad. Set USA 93, 13837 (1996) (Figure 5A). The resulting construct 
was used to generate the extrachromosomal array oxExl66[hsp::MosTransposase]. (lin- 
15(n765) hermaphrodites were injected in the syncitial gonad with a mixture of the 
following gel-purified fragments: a Hind III-Eco RI fragment of hsp:: Mos Transposase 
(injection concentration: 10 ng/^1), a Pst I-Bsi WI fragment of the ofm-l ::gfp construct 

15 (pPD97/98) that expresses GFP in the coelomocytes (gift of Piali Sengupta) (injection 
concentration: 5 ng//il) and a Kpn I-Eag I fragment of EKL15(//w-/5 +) (S. G. Clark, X. 
Lu, H. R. Horvitz, Genetics 137, 987 (1994)) (injection concentration: 10 ng/^1). Plasmid 
backbones were removed from all purified fragments. Eco RV-digested N2 worm 
genomic DNA was coinjected at a concentration of 70 ng/^1. Another extrachromosomal 

20 array, oxExl64[Mosl; rol-6(sd)] t contained the Mosl transposon. (Iin-15(n765) 

hermaphrodites were injected with a 2.2 kb Xho I-Hind III fragment of pBluescribe 
M\3+/Mosl that contains the 1.3 kb Mosl element flanked by D. simulans sequences (M. 
Medhora, K. Maruyama, D. L. Hartl, Genetics 128, 311 (1991) (injection concentration: 
10 ng/^1) and a 2.2 kb rol-6(sd) fragment of pRF4 (J. M. Kramer, R. P. French, E. C. 

25 Park, J. J. Johnson, Mol Cell Biol 10, 2081 (1990) (injection concentration: 10 ng//il). 
Eco RV-digested N2 worm genomic DNA was coinjected at a concentration of 80 ng//^l 
to increase the complexity of the array; this array also contained the dominant genetic 
marker rol-6(sd) which causes animals to roll instead of swimming in a sinusoidal 
fashion. These two strains were crossed and progeny carrying both arrays were heat- 

30 shocked as young adults. After 12 hours, the heat-shocked animals were harvested and 
genomic DNA was prepared. Mosl transposition was detected using the strategy 
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developed by van Luenen et al See H. G. van Luenen, S. D. Colloms, R. H. Plasterk, 
Embo 1 12, 2513 (1993). Specifically, insertions were identified by PCR amplification 
using one set of primers complementary to the transposon and another set complementary 
to an arbitrary target gene. DNA purification and PCR were performed as described in H. 
G. van Luenen, S. D. Colloms, R. H. Plasterk, Embo J. 12, 2513 (1993). The primers in 
5 Mosl were 0JL88 (5'-CGCATGCGGCTTACTCAC (SEQ ID NO: 4)) first PCR; and 
0JL89 (5'-GGCCCCATCCGATTACCACCTA (SEQ ID NO: 5)) second PCR. Primers 
in unc-49 were oJL19 (5 '-GCGAAACGCATACCAACTGTA (SEQ ID NO: 6)) first 
PCR; and oJL20 (5 '-TTC ATGCCG AAAAGC AGGCGT (SEQ ID NO: 7)) second PCR. 
Primers in gpa-2 were the same as described in H. G. van Luenen, S. D. Colloms, R. H. 

10 Plasterk, Embo J. 12, 2513 (1993). PCR products were gel-purified and sequenced using 
oJL89 (SEQ ID NO: 5) as a primer. A PCR product can be obtained only if a transposon 
has integrated into the target gene. The method is sensitive enough to detect a single 
insertion in the target gene in a single somatic cell of an adult animal. Insertions in two 
genes were assayed: the gpa-2 gene which encodes a G protein subunit (R. R. Zwaal, J. E. 

15 Mendel, P. W. Sternberg, R. H. Plasterk, Genetics 145, 715 (1997)), and the unc-49 gene 
which encodes a GABA receptor. B. A. Bamber, A. A. Beg, R. E. Twyman, E. M. 
Jorgensen, 1 NeuroscL 19, 5348 (1999). Mosl insertions were detected in both genes (2.5 
±1.0 inserts in 10 ng of genomic DNA, mean ± S.D., n=5 experiments). Given that the 
maximal distance of the inserts from our gene primers was approximately Ikb, it was 

20 estimated that an average of 1 0 insertions occurred per cell in heat-shocked animals. 
Insertions were also detected at low frequency in worms that contained the transposon 
array but lacked the transposase expression construct (0.09 insertions in 10 ng DNA, n=2 
experiments). These data indicated that low levels of Mosl tranposase were expressed 
from the intact Mosl transposons in the extrachromosomal array. 

25 To demonstrate that these transposon insertions represented bona fide 

transposition events, PCR products were gel-purified and the sequence of the insertion 
sites from the somatic transposition assays was determined. In all cases, the Mosl 
inverted terminal repeats were complete, the Drosophila sequences that flanked Mosl in 
the donor plasmid were no longer present, and the insertions all took place at a TA 

30 dinucleotide. Transposon insertions were distributed uniformly in exons, introns and 3' 
noncoding sequences of gpa-2 and unc-49 (Figure 5B). Comparison of 22 insertion sites 
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did not reveal a strong consensus site apart from a bias toward a T at position +1 with 
respect to the TA dinucleotide (Figure 5C). These data demonstrated that Mosl can hop 
into C. elegans chromosomes and that the transposase was sufficient to catalyze insertion 
without Drosophila host factors. 

Example 6 - Introduction of Mariner Transooson Copies into th e C eleeans Genome 
5 The full-length copy of the hornfly mariner transposon Autmar was gel-purified to 

remove non-nematode plasmid sequences. Purified Autmar was injected with linearized 
C. elegans genomic DNA and the rol-6(dm) plasmid into Un-15(n765ts) worms and 
unstable transgenic strains were recovered. Due to the presence of rol-6(dm) in the array, 
transgenic animals roll instead of displaying normal sinusoidal locomotory movements. 
10 These animals are Lin when grown at the nonpermissive temperature because they are 

genotypically lin-15(~). This array was integrated into a chromosomal location to generate 
the oxIs21 insertion. oxIs21 was mapped to chromosome X, 2.5 m.u. away from the lon-2 
locus. 

Example 7 -The Mariner Transposase Can Excise Mariner Transposons from C. 
15 Elegans Chromosomes in the Germline 

Engineered HimarJ was inserted in the glh-2 germline expression cassette 
described above. The glh-2: :Himarl construct was co-injected with linearized C elegans 
genomic DNA and the lin-15(+) plasmid into lin-15(n765ts) worms. The oxExllS 
extrachromosomal array is transmitted at each generation to a large fraction of the 
20 progeny. 

Iin-15(n765ts); oxExll9[glh-2::Himarl;lin-15(+)] males were crossed into 
lin-15(n765ts) oxIs21 [Autmar; rol-6(sd)] hermaphrodites. As predicted, animals of the 
cross-progeny were Rol nonLin. At the next generation, it was expected that 1/3 of the 
Rol animals would be found to be homozygous for oxIs2L However, among 48 Rol 

25 nonLin cloned individuals, none segregated more than approximately 75% Rols, while 6 
of 15 Rol Lin hermaphrodites segregated 100% Rol progeny. After careful 
characterization of the progeny of parent animals exhibiting various phenotypes, it was 
concluded that oxExllS could elicit the reversion of the Rol phenotype. Presumably, the 
reversion is caused by excision of the Autmar transposons from the integrated array which 

30 in turn leads to loss of the adjacent roU6(dm) genes by imprecise repair of the locus. It 
was concluded that the mariner transposase can excise mariner transposons from C. 
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elegans chromosomes in the germline. 
Example 8 - Mobilization of a Heterolo2ous Mariner Transposon in the C elegans 

Genome 

Materials and Methods: 
Reagents 

Mosl containing strain: 

The transgenic strain EG1638 that contains Mosl has been generated by 
coinjection of lin-15(n765) worms with: 

- the 2.2 kB Xho 1-Hind III fragment of pBluescribe M 13+/Mosl, M. Medhora et 
al., Genetics 128:311-318 (1991); (injection concentration: 10 ng/^1) 

- the 2.2 kB Hind III rol-6 rescuing fragment containing the semi-dominant 
mutation roU6(sulO06), J. M. Kramer et al., Mol Cell Biol 10:2081-2089 (1990); 
(injection concentration: 10 ng//zl) 

- EcoRV digested genomic DNA prepared from N2 worms (injection concen- 
tration: 80 ng/fA) 

The resulting strain Iin-I5(ts); oxExl64[Mosl; rol-6(sd)] exhibits a Rol Muv 

phenotype when grown above 20°C. The Muv phenotype is not expressed when worms 

are grown at 1 5°C. 

Mos 1 transposase expressing strain: 

As shown in Figure 2, the expression vector pJL44 (HSP::MosTase::glh-2) 

contains the following elements: 

• a 377 bp Hspl6-48 heat-shock promoter fragment recovered by PCR from 
pRP176, H. G. van Luenen et al., EMBO J 12:2513-2520 (1993), using the oligos 
oJL21 5 '-CGAAGCTTGCTGGACGGAAATAGTGG (SEQIDNO: 19) and 
oJL22 5 '-CGACGCGTTCTTGAAGTTTAGAGAAT (SEQ ID NO: 20). 

- a 1088 bp fragment containing the Mosl transposase coding sequence amplified 
by PCR from pBluescribe M 13+/Mosl using oJL77 5'- 

GCACGCGTTATGTCGAGTTTCGTGCCGAATAAAG (SEQ ID NO: 21) and 
oJL78 5'- 

GCGCTAGCTATTCAAAGTATTTGCCGTCGCTCGCGACACATTTTTCCCA 
(SEQ ID NO: 22). An artificial intron (5'- 
GTAAGTTTAAACATATATACTAACTAACCCATGGATTA- 
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TTT AAATTTTC AG-3 1 (SEQ ID NO: 18) was inserted at position 264 with 
respect to the ATG. 

- a 300 bp fragment containing the glh-2 3TJTR (nt 3287 to 4087 with respect to 
glh-2 start codon) recovered form a 6.3 kb glh-2 genomic fragment subcloned in 
pBluescript KS (Stratagene) (M. E. Gruidl et al t Proc Natl Acad Sci USA 

5 93:13837-13842 (1996); gift of Karen Bennett, University of Missouri). 

The transgenic strain EG 1643 that contains the Mosl transposase expression 
vector has been generated by coinjection of Un-15(n765) worms with: 

- the Hind III-EcoRl fragment of pJL44 (injection concentration: IOng/^1) 

- the Eag I-Kpn I /£w-15 genomic rescuing fragment from EKL15, S. L. Mclntire 
10 et al., Nature 389:870-876 (1997) 

- the Pst I-BsiW I fragment of pPD97/98 that drives expression of the Green 
Fluorescent Protein in the coelomocytes (gift of Piali Sengupta, Brandeis 
University) (injection concentration: 10 ng//il) 

- EcoRV digested genomic DNA prepared from N2 worms (injection 
15 concentration: 80 ng/^1) 

The resulting strain Iin-I5(ts); oxExl66 [hsp::MosTase:. glh-2 ; pPD97 198; lin- 
15(+)] has a wild-type phenotype. The presence of the extrachromosomal array causes 
expression of GFP in the coelomocytes which can be visualized using fluorescence 
microscopy. 

20 Mobilization of the transposon in the C elegans genome 

Mobilization of Mosl was achieved by crossing the transposase-expressing strain 
into worms containing the Mos 1 transposon-containing array. Iin-I5(ts)\ oxExl66[hsp:: 
MosTase:. -glh-2; pPD97198; lin-15(+)J hermaphrodites were crossed with N2 males at 
25 °C. Non-Muv males lin-15(ts); oxExl66 were crossed with Iin-I5(ts); oxExl64[Mosl; 

25 rol-6(sd)] Rol non-Muv hermaphrodites previously grown at 1 5°C. 

The cross was kept at 20°C. Late L4 larvae or young adult Rol worms were 
transferred to a fresh plate and heat-shocked for 1 hour at 35 °C. After 6 hours, non-Muv 
Rol P0 animals (Hn-lS(ts); oxExl64; oxExl66) were transferred to a fresh plate and 
allowed to lay eggs for 48 hours. A fraction of the Fl animals contain insertions of Mos 1 

30 in their genome and can be screened for mutant phenotypes. 
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Identification of transposon insertion sites 

Mosl insertion were identified by inverse PCR, as shown in Figure 3. Genomic 
DNA was prepared according to standard procedure. Approximately 100 ng of genomic 
DNA was digested by Sau3A in a 10 \x\ volume for 3 to 14 hours. The restriction enzyme 
was inactivated by heating for 20 minutes at 70°C. Fragments were circularized by self- 
5 ligation (overnight incubation at 1 5 °C with 5 units of T4 DNA ligase). 

3 iA of ligated DNA was used for PCR amplification. A first round of 
amplification was performed using the primers oJL103 5'- 
TCTGCGAGTTGTTTTTGCGTTTGAG (SEQ ID NO: 3) and oJLl 14 5'- 
AAAGATTCAGAAGGTCGGTAGATGGG (SEQ ID NO: 10) (30 cycles, 45" at 94°C / 

10 V at 60°C / 1'15" at 72°C, magnesium chloride concentration: 1.5 mM). The product of 
the first amplification was diluted 100-fold and subjected to a second round of 
amplification using the nested primers oJLl 15 5 '-GCTC AATTCGCGCCAAACTATG 
(SEQ ID NO: 1 1) and oJLl 16 5 '-GAACGAGAGGC AGATGGAG AGG (SEQ ID NO: 
12) (25 cycles, 45" at 94°C / 1' at 62°C / ri5" at 72°C, magnesium chloride 

15 concentration: 2.5 mM). Resulting fragments were run on an agarose gel, gel-purified and 
sequenced either directly or after subcloning. 

Figure 4 contains the sequence of an inverse PCR product demonstrating insertion 
of Mosl in chromosome X. Nucleotides in capital letters are from the Mosl transposon. 
C. elegans flanking genomic region is in lower case. It matches the Y47C4.Contig215 

20 sequence from chromosome X available at the Sanger Centre. 

Mosl, a mariner-like transposon isolated from Drosophila mauritiana was used. 
Transgenic worms containing Mosl in an extrachromosomal array were crossed with 
transgenic worms containing an expression vector in which a heat-shock promoter (hsp 
16-48) drives the expression of the mos transposase (Figure 1). Cross-progeny containing 

25 both the Mosl transposon and the mos transposase were isolated. Heat-shock of these 

worms induced the expression of the transposase which in turn caused Mosl elements to 
transpose from the extrachromosomal array into the C. elegans genome. Five insertions 
were isolated, for a rate of one in seventeen animals analyzed. However, this array is only 
20% stable per generation. Thus, there in on average one transposition into chromosomes 

30 for every three germ cells exposed to the transposon. 

Some insertions will disrupt genes and cause mutant phenotypes. Mutant worms 
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are outcrossed with wild-type worms containing no Mosl transposon. Since the insertion 
responsible for the mutation cosegregates with the mutant phenotype, it is possible to 
isolate the single relevant Mosl insertion after only a few outcrosses. Genomic DNA is 
then prepared from the outcrossed mutant. Regions flanking the transposon are recovered 
by inverse PCR and sequenced. Comparison of flanking sequences with the C. elegans 
5 genome sequence allows immediate identification of the mutated gene. This new 

mutagenesis system will significantly speed up the identification of genes of interest 
using C. elegans as a genetic model. 

Example 9 - Mosl Mutagenesis and Rapid Cloning of Genes 
In one embodiment, the method described in this invention is capable of 

1 0 generating mutations which can be rapidly cloned based on the Mosl unique DNA tag. 
To demonstrate that this is true, mutants have been identified and the relevant genes have 
been cloned using inverse PCR. Specifically, a morphological mutant in C. elegans was 
isolated which causes the worms to be short and squat. Such mutations are called dumpy 
mutations and are given the three letter designation "dpy". A dumpy animal was 

15 identified after mobilization of the wild-type Mosl transposon. DNA was prepared, 

cleaved with the restriction enzyme Sau3A, and religated. Inverse PCR was performed 
using primers contained within the transposon but facing outward. The amplified 
fragment was sequenced. The Mosl element was inserted 175 nucleotides 5' of F54D8.1, 
which encodes a collagen protein. An inspection of the genetic map demonstrated that 

20 this insertion is in a chromosomal interval which also contains the dpy-1 7 gene which had 
been previously defined by point mutations using chemical mutagens by Sydney Brenner 
in 1974. A complementation test was performed and the test demonstrated that this 
mutation was an allele of dpy-1 7. Thus, the method is capable of rapidly demonstrating 
the molecular identity of a gene which had remained unknown for almost 30 years. 

25 Mutants incapable of detecting high osmotic gradients (Osm) were also screened for. The 
first Osm mutant identified was cloned in a similar manner and proved to be an insertion 
of Mosl in exon 10 of the eau4 gene. 

Example 10 - Targets of Transposase and Transposon 
For Mosl insertions to be useful for the cloning of mutated genes, the transposase 

30 must specifically mobilize Mosl and not other mariner elements. The C. elegans genome 
contains endogenous transposons. Apart from the most active Tel and Tc3 transposons, 
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which are distantly related to Mosl, every haploid genome contains at least 55 copies of a 
Mariner Like Element (MLE), which is closely related to Mosl. M. M. Sedensky, S. J. 
Hudson, B. Everson, P. G. Morgan, Nucleic Acids Res. 22, 1719 (1994); H. M. 
Robertson, D. J. Lampe, Mol Biol Evol 12, 850 (1995). Since in a few cases 
transposases of the Mariner family have been shown to cross-mobilize distinct but related 
5 transposons (P. Sundararajan, P. W. Atkinson, D. A. O'Brochta, Insect Mol Biol 8, 359 
(1999)), it was tested whether Mos transposase expression had triggered transposition of 
the endogenous MLEs. Eight strains in which Mosl insertions had occurred were 
analyzed by Southern blot for changes in MLE distribution. No changes in MLE 
distribution were detected. Worm genomic DNA of lin- 15 (n7 65) and Mosl -containing 

10 strains was extracted, Bgl II digested and run for Southern blot analysis using standard 
procedures. Oligos oJL132: 5'-ATATGCGGTGCGATGGGTGAG (SEQ ID NO: 8) and 
oJL133: 5 '-GGC G AACGCG ATG AGAAG AAAG (SEQ ID NO: 9) were used to amplify 
a 842 bp MLE fragment from N2 worm genomic DNA. The PGR product was sequenced 
and used for probe synthesis, (data not shown) indicating that Mos transposase is specific 

1 5 for Mosl in the C. elegans germline. 

How many insertions occurred in every animal and what were their distributions? 
The number of chromosomal insertions per strain was determined by Southern blot 
analysis in eight insertion strains. Only one insertion per strain was detected (Figure 7A). 
To determine the location of the mobilized transposons, the left junctions of 17 insertions 

20 were cloned using inverse PCR. Approximately 1 00 ng of total genomic DNA was 

digested with Sau3A, self-ligated under dilute conditions, and then 3 % of the ligation 
was subjected to two rounds of nested PCR using the following primers: oJL103 (SEQ ID 
NO: 3)/oJLl 14 5'-AAAGATTCAGAAGGTCGGTAGATGGG (SEQ ID NO: 10) (first 
PCR), oJLl 15 5'-GCTCAATTCGCGCCAAACTATG (SEQ ID NO: 1 1) / oJLl 16 5'- 

25 GAACGAGAGGCAGATGGAGAGG (SEQ ID NO: 1 2) (second PCR). PCR products 
were purified on agarose gel and sequenced using oJLl 15 (SEQ ID NO: 1 1) as a primer. 
In agreement with the Southern blot experiments, only one insertion per strain was 
detected. Insertion sites were distributed on all six chromosomes (Figure 7B). 
Transposition occurred into exons, introns and intergenic regions (Table 2). Sequences 

30 flanking both sides of the transposon were determined for nine of the localized insertions. 
In each case, the inverted terminal repeats were complete and flanked by a TA 
dinucleotide that arose from the duplication of the original TA found in the genomic 
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Table 2. Properties of Mosl genomic insertions. Mosl flanking were compared with the 
C. elegans genome sequence. Physical location in the genome is given as the nucleotide 
position of the corresponding clone in the C. elegans database (ACeDB). 

5 



Isolation 
name 


Transposon source 


Physical location 


Interpolated 
genetic location 


Genefinder 
predictions 


oxTil 


Extrachromosomal 


Y65B4BL (2)27,362 


LGI, - 19 


Intergenic 


oxTi2 


Extrachromosomal 


Y44E3A (2} 34,440 


LGI, - 4.75 


Intergenic 


oxTi3 


Extrachromosomal 


M01E5 @19,740 


LGI, + 29.9 


Intergenic 


oxTi4 


Extrachromosomal 


T13C2 (a) 4 948 


LGII + 0 1 


Exon #4 of 
F41G.12 


oxTiS 


Extrachromosomal 


K08E5® 31,631 


LGIII, +4.61 


Intergenic 


oxTi6 


Extrachromosomal 


H23L24 @ 4,529 


LGIV, + 3.9 


Intergenic 


oxTi7 


P y tra r Vi mm o *;nm 3 1 

1—/ A. LI CLyjlll UlUUoUUluJ 


K08D8 (a) 4234 


LGIV +66 


Tnterpenic 


oxTi8 


Extrachromosomal 


R09B5 @ 22,929 


LGV, -19.0 


Exon #6 of 
R09B5.12 


oxTi9 


Extrachromosomal 


Y69H2@ 39,771 


LGV, + 17.49 


Intron #5 of 
Y69H2.4 


oxTilO 


Extrachromosomal 


Y47C4A 


LGX, -20 


Repeat 


oxTill 


Extrachromosomal 


C34E11 @ 12,022 


LGX, +6.55 


Exon #10 of 
C34E11.1 


oxTil2 


Integrated array 


Y71A12B @ 50,370 


LGI, +21 


Intergenic 


oxTil3 


Integrated array 


Y48G1C(S). 19,916 


LGI, - 19.8 


Intergenic 


oxTil4 


Integrated array 


C17F4@ 22,793 


LGII, -8.06 \ 


Exon #18 of 
gey- 19 


oxTil5 


Integrated array 


F35C5 @ 5,735 


LGII, +10 


Intergenic 


oxTil6 


Integrated array 


C06B3 @ 14,747 


LGV, +5.79 


Intergenic 


oxTi!7 i 


Integrated array 


R01H2@ 20,193 | 


LGIII, - 0.86 


Intergenic 



25 

Comparison of the insertion site sequences did not reveal a strong consensus motif 
for the target DNA. Molecular analysis of the insertions therefore demonstrated that Mosl 
insertion obeyed properties previously observed for mariner class transposons. However, 
a formal possibility remained that Mosl hopped into the genomic DNA present in one of 
30 the extrachromosomal arrays and that recombination occurred subsequently between the 
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array and the genome. To rule out this possibility, the insertion oxTi4 which was 
positioned 35 kb away from snt-1 was genetically mapped. In agreement with this 
physical location, oxTi4 was mapped less than 2.5 map units from snt-1: 20 Snt-1 
individuals were cloned from the self-progeny of oxTi4lsnt-l hermaphrodites. None of the 
mutants segregated oxTi4. The presence of oxTi4 was determined by PCR using one 
5 Mosl primer pointing towards the right end of the transposon (o JL89 (SEQ ID NO: 5)) 
and one primer in the genome (oJL129 5'-CCAAATGCGTCTGTCCCACTC (SEQ ID 
NO: 13)). A PCR positive control was performed on each DNA sample using cha-1 
primers. 

Example 11 - Remobilization of a Genomic Transposon Insertion 

10 The transposition events documented above were all excisions from an array of 

transposons residing in Drosophila DNA. To determine whether the transposase acts on a 
single Mosl transposon in a C. elegans chromosome, the oxTi4 insert was remobilized. 
Primers for PCR were designed flanking the oxTi4 insertion. A first PCR round was 
performed with primers located 1671 nt upstream and 3144 bp downstream to oxTi4 

15 (respectively oJL149 5'- AAGTATGGCCAAACGACCCGACAC (SEQ ID NO: 14) and 
oJL150 5'- GCATTGGCACCTTTCTCCCTTCT (SEQ ID NO: 15)). A second round was 
performed using primers 493 bp upstream and 913 downstream to oxTi4 (respectively 
0JL145 5'- ACAGGCAGCATTTTGTAGTCT (SEQ ID NO: 16) and oJL148 5'- 
AGGCTGCCTCGTAAGTTCCTACAG (SEQ ID NO: 17)). Short PCR products were gel 

20 purified, subcloned and sequenced. The transposase-expressing transgene 

{oxExl 67 [glh-2: Transposase]) was crossed into animals homozygous for the oxTi4 
insertion and DNAs from the progeny were analyzed for amplified fragments shorter than 
the insertion. These shorter PCR products represented a variety of excision events, 
including the three nucleotide excision footprint previously characterized for Mosl 

25 excisions (G. Bryan, D. Garza, D. Haiti, Genetics 125, 103 (1990)), as well as smaller 

footprints, excisions and even incomplete excisions (Table 3). Since these products could 
arise from excision events in somatic cells, progeny animals that lost the transposase 
expression array were analyzed. Pools of 1 5 individuals from oxTi4; 
oxExl 67 [glh-2 . Transposase] progeny that lost the transposase array were transferred to 

30 fresh plates and allowed to lay eggs for 24 hours. Adult worms were then analyzed by a 
single round of PCR using the primers oJL145 (SEQ ID NO: 16)-oJL148 (SEQ ID NO: 
17). Sixty individuals were cloned from the progeny of the pool exhibiting short PCR 
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product and analyzed at the next generation to identify clones that lost oxTi4. One animal 
was identified among 954 progeny in which excision of the transposon had occurred. In 
this animal the excision left a 3 bp footprint and the duplicated TA dinucleotide which 
together resulted in a +2 frameshift. These data indicate that single copies of the Mosl 
Drosophila transposon can excise from C. elegans DNA in the germline to introduce 
5 frameshift or deletion mutations at the transposon insertion site. 

Table 3. Lesions generated by excision of the oxTi4 insert. The extrachromosomal 
[glh-2:Transposase] transgene was crossed into animals homozygous for the oxTi4 
insertion. PCR was used to analyze the oxTi4 insertion site after the loss of Mos 1. Pools 

10 of 15 individuals from oxTi4; oxExl67[glh-2:Transposase] progeny that lost the 

transposase array were transferred to fresh plates and allowed to lay eggs for 24 hours. 
Adult worms were then analyzed by a single round of PCR using the primers oJL145 
(SEQ ID NO: 16)-oJL148 (SEQ ID NO: 17). Sixty individuals were cloned from the 
progeny of the pool exhibiting short PCR product and analyze at the next generation to 

15 identify clones that lost oxTi4. Top line: sequence of oxTi4. Lower case: Mosl sequence. 
Upper case: genomic sequence. Bold: TA dinucleotide duplicated during Mosl insertion. 
Bottom lines: excision products. Dash: deleted base pairs. The insertion (bottom line, 
italic letters) corresponds to an internal fragment of Mosl (nt 147 to 178). 

20 

CTCTTTTCCAGACGAGTAccaggtgtac tacacctgaTATATCCTTTTGTTCCTT 

CTCTTTTCCAGACGAGTA TATATCCTTTTGTTCCTT 

CTCTTTTCCAGACGAGTA aTATATCCTTTTGTTCCTT 

25 CTCTTTTCCAGACGAGTA tgaTATATCCTTTTGTTCCTT 

CTCTTTTCCAGACGAGTAc TATATCCTTTTGTTCCTT 

249 bp deletion — tgaTATATCCTTTTGTTCCTT 

CTCTTTTCCAGACGAGa 143 bp deletion 

CTCTTTTCCAGACGAGTA 188 bp deletion 

30 463 bp deletion 

CTCTTTTCCAGACGAGTA attgtttactctcagtgcagtcaacatRtcgaTATCCTTTTGTTCCTT 
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Example 12: - Engineering Mutations in the C. elegans Genome by Transgene 
Instructed DNA Double Strand Break Repair Following Mosl Excision 
Germline expression of the Mos transposase under the control of the glh-2 
promoter causes reexcision of single copies of Mosl inserted in the C. elegans genome. 
Immobilization of the transposon causes a DNA double strand break (DSB) at the site of 
5 excision which is repaired by the cellular machinery. In 1992, R. Plasterk and J. Groenen 
(EMBOJ. 1 1 :287) demonstrated that a DSB caused by excision of a Tel transposon in a 
mut-6(st702) background can be repaired using DNA contained in an extrachromosomal 
array that carries sequences homologous to the region of excision. As a result, sequences 
flanking the break can be replaced by sequences contained in the transgene. This strategy 

10 provides a way to engineer mutations in the genome. However, this approach never 

became a routine strategy probably because transposition is not controlled and excision 
occurs at low rates in such mutant strains. 

The controlled transposition of Mosl provides an efficient tool to use this strategy 
for engineering of the C. elegans genome: after a Mosl insertion has been identified in 

15 the gene of interest, a transgene is constructed with mutated sequences homologous to the 
region of insertion. The transgene that carries the glh-2: :MosTransposase expression 
vector is crossed into the.starin that contain the Mosl genomic insertion and the template 
transgene. Expression of Mos transposase causes Mosl excision and the progeny is 
screened by PCR for transgene instructed repair at the excision site (Figure 8). 

20 The feasibility of regulated mobilization of a heterologous transposon in the C 

elegans germline was thus demonstrated. The characteristics of Mosl transposition 
suggest that it could be used as a technique for tagging mutant genes. First, the Mos 
transposase does not activate transposition of endogenous transposons. Second, 
transposition of Mos in the germline is strictly dependent on the expression of the 

25 transposase. In this respect the use of a heat-shock promoter to express the transposase is 
of particular interest since it provides a convenient way to turn transposition on and off 
and to stabilize new inserts. Third, insertion sites of Mosl in the genome do not exhibit 
strong sequence bias. Transposons were inserted into exons, introns and intergenic 
regions. Comparison of the insertion sites did not reveal a strong consensus sequence 

30 apart from the TA dinucleotide. Fourth, excision and insertion frequencies can be 

differentially manipulated by expressing the transposase under the control of different 
promoters. The heatshock promoter caused very low rates of excision and loss of the 

33 



WO 00/73510 



PCT/US00/40091 



transposon array but high rates of transposon insertion. The glh-2 promoter construct 
caused a low rate of insertion but a high rate of excision and loss of the transposon array. 
Since transposon insertions frequently do not disrupt gene function in C. elegans even if 
the insertion occurs in an exon (A. M. Rushforth, B. Saari, P. Anderson, Mol Cell Biol. 
13, 902 (1993); A. M. Rushforth, P. Anderson, Mol. Cell Biol 16, 422 (1996)), 
transposons are usually remobilized to generate deletion alleles (D. Eide, P. Anderson, 
Mol Cell Biol 8, 737 (1988); R. R. Zwaal, A. Broeks, J. van Meurs, J. T. Groenen, R. H. 
Plasterk, Proc, Natl Acad. Sci. USA9Q, 7431 (1993)). It was thus demonstrated that the 
glh-2 expression construct can be used to generate deletion alleles of the genes containing 
Mosl insertions. 

Mosl transposition in C. elegans will allow the development of two new genetic 
tools. First, mutations identified in forward screens using Mosl will allow the rapid 
cloning of the mutated gene. Second, a library of insertions localized in the genome could 
be generated; the glh-2 expression construct could then be used to remobilize these 
insertions at high frequency and generate deletion and frameshift mutations in genes of 
interest. 
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CLAIMS: 

1 . A method of regulated expression of a heterologous gene in cells of the germline 
of C. elegans comprising the steps of: 

a. inserting a transgene construct into the C. elegans, wherein the construct 
comprises the heterologous gene operably linked to a promoter and a 3' 

5 untranslated region of a gene that is expressed in the G elegans germline, 

wherein the promoter is an inducible promoter or a germline-specific 
promoter; and 

b. expressing the heterologous gene. 

2. The method of Claim 1 , wherein the promoter is inducible. 

10 3 . The method of Claim 2, wherein the promoter comprises a heat-shock promoter. 

4. The method of Claim 2, wherein the promoter comprises a tetracycline-regulated 
promoter. 

5. The method of Claim 1, wherein the construct is substantially free of bacterial 
plasmid sequences. 

1 5 6. The method of Claim 1 , wherein the construct is substantially free of repeated 
DNA sequences. 

7. The method of Claim 1, wherein the 3' untranslated region comprises a glh-2 3' 
untranslated region. 

8. The method of Claim 1 , wherein the promoter comprises a glh-2 promoter. 
20 9. A transgene construct for expression in C elegans comprising a heterologous 

gene operably linked to a promoter and a 3' untranslated region of a gene that is 
expressed in the C elegans germline, wherein the promoter is an inducible 
promoter or a germline-specific promoter. 
10. The transgene construct of Claim 9, wherein the promoter is inducible. 
25 11. The transgene construct of Claim 1 0, wherein the promoter comprises a heat- 
shock promoter. 

12. The transgene construct of Claim 10, wherein the promoter comprises a 
tetracycline-regulated promoter. 

13. The transgene construct of Claim 9, wherein the construct is substantially free of 
30 bacterial plasmid sequences. 

14. The transgene construct of Claim 9, wherein the construct is substantially free of 
repeated DNA sequences. 
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15. The transgene construct of Claim 9, wherein the 3' untranslated region comprises 
diglh-2 3' untranslated region. 

1 6. The transgene construct of Claim 15, wherein the promoter comprises a heat- 
shock promoter. 

1 7. The transgene construct of Claim 9, wherein the promoter comprises a glh-2 
5 promoter. 

1 8. The transgene construct of Claim 9, wherein the heterologous gene is a 
transposase. 

19. The transgene construct of Claim 9, wherein the heterologous gene is a TC3A 
transposase gene. 

10 20. A method of transposon-mediated mutagenesis in a C. elegans genome, 
comprising the steps of: 

a. introducing a transgene construct into the C. elegans genome, wherein the 
construct comprises a transposase gene which is operably linked to a 
regulable expression control element and a 3' untranslated region of a gene 

15 that is expressed in the C. elegans germline; and 

b. expressing the transposase gene, such that a transposon in the C elegans 
genome transposes, causing a mutation. 

21 . The method of Claim 20, wherein the transposons comprise endogenous 
transposons. 

20 22. The method of Claim 2 1 , wherein the transposons comprise Tc3 transposons. 

23. The method of Claim 20, wherein the transposase gene is a TC3A transposase 
gene. 

24. The method of Claim 22, wherein the transposase gene is a TC3A transposase 
gene. 

25 25. The method of Claim 21, wherein the regulable expression control element is an 
inducible promoter. 

26. The method of Claim 26, wherein the promoter comprises a heat-shock promoter. 

27. The method of Claim 25, wherein the promoter comprises a tetracycline-regulated 
promoter. 

30 28. The method of Claim 20, wherein the construct is substantially free of bacterial 
plasmid DNA sequences. 
29. The method of Claim 20, wherein the construct is substantially free of repeated 
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DNA sequences. 

30. The method of Claim 20, wherein the 3' untranslated region comprises a glh-2 V 
untranslated region. 

3 1 . The method of Claim 30, wherein the regulable expression control element 
5 comprises a heat-shock promoter. 

32. The method of Claim 30, wherein the regulable expression control element 
comprises a, glh-2 promoter. 

33. The method of Claim 20, further comprising introduction of one or more 
additional copies of an endogenous transposon into the C. elegans germline. 

10 34. The method of Claim 33, wherein the endogenous transposon is a Tc3 transposon. 

35. The method of Claim 20, wherein the transposons comprise heterologous 
transposons. 

36. The method of Claim 35, wherein the heterologous transposons are introduced 
into the C. elegans genome: 

1 5 37. The method of Claim 35, wherein the transposons comprise Mos 1 transposons. 

38. The method of Claim 35, wherein the transposase gene comprises restriction sites 
5' of the start codon, restriction sites 5' of the stop codon, and an artificial intron 
in the transposase gene open reading frame. 

39. The method of Claim 35, wherein the regulable expression control element is an 
20 inducible promoter. 

40. The method of Claim 39, wherein the promoter comprises a heat-shock promoter. 

41 . The method of Claim 39, wherein the promoter comprises a tetracycline-regulated 
promoter. 

42. The method of Claim 35, wherein the construct is substantially free of bacterial 
25 plasmid DNA sequences. 

43. The method of Claim 35, wherein the construct is substantially free of repeated 
DNA sequences. 

44. The method of Claim 35, wherein the 3' untranslated region comprises a glh-2 3' 
untranslated region. 

30 45. The method of Claim 44, wherein the regulable expression control element 
comprises a heat-shock promoter. 
46. The method of Claim 44, wherein the regulable expression control element 
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comprises a glh-2 promoter. 

47. A method of introducing a heterologous DNA sequence into a C. elegans 
chromosome comprising the steps of: 

a. introducing a transposon into the C. elegans, wherein the transposon 
comprises the heterologous DNA sequence; 

b. introducing a transgene construct into the C. elegans, wherein the 
construct comprises a transposase gene which is operably linked to a 
promoter and a 3 5 untranslated region of a gene that is expressed in the C. 
elegans germline; and 

c. expressing the transposase, such that the transposase integrates as a single 
copy into a C elegans chromosome. 

48. The method of Claim 47, wherein the heterologous DNA sequence comprises a 
bacterial plasmid DNA sequence. 

49. The method of Claim 47, wherein the gene carried on the transposon is useful for 
selection or screening purposes. 

50. The method of Claim 47, wherein the transposon contains FRT/FLP 
recombination sites. 

5 1 . The method of Claim 47, wherein the promoter is inducible. 

52. The method of Claim 47, wherein the promoter comprises a heat-shock promoter. 

53. The method of Claim 47, wherein all bacterial plasmid sequences have been 
removed from the construct. 

54. The method of Claim 47, wherein the construct is substantially free of repeated 
DNA sequences. 

55. The method of Claim 47, wherein the 3' untranslated region comprises a glh-2 3' 
untranslated region. 

56. The method of Claim 47, wherein the promoter comprises a glh-2 promoter. 
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(MosJ left end) 

CAGTCAAGGTTGACACTTACAAGGTCAAAGTTTTATGACAATCGATAAATATTTACGTT 
TGCGAGACATCTATATGTTCGAACCGACATTCCCTACTTGTACACCTGGtaaatgaaag 
ctggtgacgtggagattacgtccccgtaaaaattattgcgaaatatgcaacggtggccg 
agaaaatccgcgaccccgtcgacccagacacggttgattctccagtgacggtcgatcAA 
CAAAAAAGATCCATTTTTCATCTCCAGTAACGATACGATGCAAAAACGACTTCCTTTTG 
TATCGTGAAAGCAAAATTTCGCATGTGTTTTTGCGCCTCTCCATCTGCCTCT 

BsmJl Sequence of an inverse PCR product. Nucleotides in capital letters are 
from the Mosl transposon. C. elegans flanking genomic region is in lower case. It 
matches the Y47C4.Contig215 sequence from chromosome X available at the 
Sanger Centre. 
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chromosome 

2 

transgene 




Figure 8: Knock-in strategy: Mosl excision causes a DNA double strand break (1). A transgene 
containing sequences homologous to the excision region pairs with the chromosome (2). 
Mutation contained in the transgene is copied into the chromosome (3). 
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